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Abstract: Simple sequence repeats (SSRs, also known as microsatellites) are known to be 
mutational hotspots in genomes. DNA rearrangements have also been reported to 
accompany allopolyploidization. A study of the effect of allopolyploidization on SSR 
mutation is therefore important for understanding the origin and evolutionary dynamics 
of SSRs in allopolyploids. Three synthesized double haploid (SynDH) populations were 
made from 241 interspecific Fi haploid hybrids between Triticum turgidum L. and 
Aegilops tauschii (Coss.) through spontaneous chromosome doubling via unreduced 
gametes. Mutation events were studied at 160 SSR loci in the Si generation (the first 
generation after chromosome doubling) of the three SynDH populations. Of the 148260 
SSR alleles investigated in Si generation, only one mutation (changed number of repeats) 
was confirmed with a mutation rate of 6.74 x 10~ 6 . This mutation most likely occurred in 
the respective Fi hybrid. In comparison with previously reported data, our results suggested 
that allohexaploidization of wheat did not increase SSR mutation rate. 
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1. Introduction 

Simple sequence repeats (SSRs, also known as microsatellites) are a class of DNA that is composed 
of short tandem repeats of a basic motif of one to six nucleotides. SSRs are ubiquitous in genomes. 
The nature of motif repetition makes SSRs inherently instable and prone to mutation by mechanisms of 
replication slippage or unequal crossing over [1]. Because of their high mutability, SSRs are thought to 
play a significant role in genome evolution [2]. Meanwhile, their abundance and high polymorphism 
make SSRs a popular molecular marker system in genetic studies and breeding programs as well [3]. 

Since it is a critical parameter in models of population genetics, the SSR mutation rate has been 
estimated in numerous studies [1,4]. The estimated average SSR mutation rate per allele and per 
generation varied greatly among species. For example, the rate was estimated as 10~ 6 in fruit fly [5], 
10 in human [6] or about 10 -10 in crops like durum wheat [7], common wheat [4], and maize [8]. 
Exposed to external stresses such as irradiation, oxidative damage, in vitro culture etc. will increase 
SSR mutation rates [2]. 

Allopolyploidization is an important evolutionary process that merges two distinct genomes into the 
nucleus to form an allopolyploid species. This process is particular important to higher plants. The 
merger of two distinct genomes presents a "genomic shock", as McClintock [9] described, to which 
plants respond with a variety of genomic changes. DNA sequence rearrangements accompanied with 
allopolyploidization have been reported in various allopolyploid plant models, such as wheat [10-13], 
Brassica [14-18], Triticale [19-21] and Arabidopsis [22,23]. Allopolyploidization involves two steps: 
interspecific hybridization and chromosome doubling. It is interesting to know whether the two steps 
can accelerate SSR mutations. Such a study, however, is difficult since the sample size of independent 
allopolyploidization events needs to be large enough to reach a sufficient accuracy in detection of 
mutation events and conducting the two steps in experimental condition is very labor- and 
time-consuming. Utilizing genes that promote formation of unreduced gametes in haploids is a way to 
facilitate such a study [24]. 

In the present study we investigated SSR mutation events that existed in the 241 Si (the first 
generation after chromosome doubling) lines from three synthesized double haploid (SynDH) 
populations. Each of these SynDH lines was derived from an independently obtained interspecific Fi 
haploid hybrid between Triticum turgidum L. and Aegilops tauschii Coss (Table 1; Figure 1). These 
241 SynDH hexaploid wheat lines represent 241 independent allopolyploidization events and therefore, 
are excellent materials for estimating SSR mutation rate during this important evolutionary event. 
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Table 1. Numbers of simple sequence repeats (SSR) markers scored and their estimated 
alleles in three populations. 



Double haploid 


Hybrids of haploid (triploid) F x hybrids 


No. S x 


No. of 


No. of 


populations 


Female 


Male 


lines 


scored 


estimated 


(hybrid combinations) 






analyzed * 


markers ** 


Si alleles 


SynDHl 


Fi hybrids between 


Ae. tauschii AS 60 


113 


81 


93790 


(LDN/AS313//AS60) 


T. turgidum LDN 
and AS313 










SynDH2 


Fi hybrids between 


Ae. tauschii AS 60 


89 


12 


11570 


(LDN/AS2255//AS60) 


T. turgidum LDN 
and AS2255 










SynDH3 (PI377655// 


T. turgidum 


hybrids between 


39 


79 


42900 


AS66/AS87) 


PI377655 


Ae. tauschii AS 66 and 
AS87 








Total 






241 


160 


148260 



* Each of these Si SynDH lines was derived from an independently obtained interspecific Fi haploid hybrid between 
Triticum turgidum L. and Aegilops tauschii Coss. A bulked DNA sample from five randomly selected Si plants per 
SynDH line was used in investigation of SSR mutation events. The five Si plants from each SynDH line are expected to 
be genetically identical since they were derived from a single Fi hybrid; ** Of the 81 markers used in SynDHl, 12 were 
also used in SynDH2. Therefore, a total of 160 SSR markers were used. 



Figure 1. The production and pedigree of synthetic double haploid population SynDH3 
used in this study. Ae\ Aegilops; T: Triticum; SynDH: synthesized double haploid. 

Ae. tauschii AS66 X Ae. tauschii AS87 
(2n=2x=14, DD) J (2n=2x=14, DD) 

T. turgidum PI377655 X F x hybrids 
(2n=4x=28, AABB) j (2n=2x=14, DD) 

39 Interspecific F t hybrids 

(n=3x=21,ABD) 

/ ■ 

j Selfing & Spontaneous \ 

\ chromosome doubling j 

• • : 

Si seeds of 39 SynDH lines 
(2n=6x=42, AABBDD) 

Bulk of 5 Si plants for each of 39 SynDH lines 

2. Results and Discussion 

SSR mutations are predominantly manifested in changing numbers of the repeat motif. Hence, our 
investigation of SSR mutants and thus the estimation of SSR mutation rate were focused on variation 
of the repeat numbers at each SSR locus using bulked DNA samples from five randomly selected Si 
plants per SynDH line. The five Si plants from each SynDH line are expected to be genetically 
identical since they were derived from a single Fi hybrid (Figure 1). 
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A total of 160 SSR markers that were polymorphic between the parents were used in this study 
(Table 1). Of them, 148 were mapped to particular loci of the SynDH lines (Table SI). The 
chromosome locations of the remaining 12 were determined based on the information from 
GrainGenes database. Of the three SynDH populations, a total of 148260 SSR alleles were recognized 
and investigated. As expected, the amplification pattern in a SynDH Si line was a combination of its 
parents' for 158 markers (Figure SI showing an example image for marker Xbarcl37). However, the 
expected SSR patterns from T. turgidum parents for marker Xcfd4 (Figure 2) and from Ae. tauschii for 
marker Xwmc312 (Figure 3) were lost in the analyzed SynDHl lines. However, the losses were also 
observed in the amplification patterns of DNA mixtures of its parents. Apparently, the observed losses 
did not result from SSR mutations but most likely from a competition for PCR amplification between 
the parental homoeo-alleles. 

Figure 2. SSR amplification patterns of marker Xcfd4. The samples were two of SynDHl 
lines (1, 2); DNA mixture of their three parents (AS313, LDN and AS60) with a ratio of 
1:1:1 (3); Aegilops tauschii line AS60 (4); Triticum turgidum parents LDN (5) and 
AS313(6). 



2 3 4 5 6 




Figure 3. SSR amplification patterns of marker Xwmc312. Compared to their parents AS60 
(1); LDN (2) and AS313 (3); Si (lane SI) and S 3 (lane S3) of SynDHl line 61 showed 
novel SSR patterns. AS313 and AS60 had same SSR pattern. SSRs from AS60 (1) were 
absent in SynDHl line 61 (SI and S3) and these SynDHl lines carrying LDN allele. The 
SSRs in other SynDHl lines should be the products of AS313 allele (3) rather than AS60 
(1) due to a competition for PCR amplification. 



SynDHl Lines 




SSR gain was only observed in the line 61 of SynDHl population at locus Xwmc312. To make sure 
that this gain mutation is true, we repeated the PCR assay eight times and then assayed the respective 
S3 plants eight times. We were able to confirm the observation in each of these assays (Figures 3,4), 
suggesting that this SSR mutation was homozygous in the Si and has been stably transmitted into the 
offspring. This SSR mutation most likely took place in the respective haploid Fi and then homologized 
in the Si via the union of unreduced gametes. SSR marker Xwmc312 on 1A was mapped with the 
SynDHl population and was found to be flanked with SSR markers Xbarcl48, Xgwm357 and 
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Xwmc83, high-molecular- weight glutenin subunit Glu-Al, and Diversity Arrays Technology (DArT) 
loci wPt-4658, wPt-5316 and wPt-8016 (data not shown). Since all these flanking loci in SynDHl line 
61 were derived from LDN, this novel SSR was apparently mutated from the LDN allele. 

Figure 4. SSR amplification patterns of marker Xwmc312. Compared with their parents 
AS60, Langdon and AS313, SI and S3 of SynDHl line 61 showed novel SSR patterns. 
The 6% polyacrylamide gel used in this study that can detect 1 bp difference between 110 
and 1 1 1 bp products. 




This novel SSR was confirmed by analyzing the sequences of the Xwmc312 alleles cloned from 
SynDHl line 61 and its parents. A total of 50 clones were sequenced and analyzed. All of the clones 
contained the same SSR flanking sequences of 173 bp and differed only in the number of the GA 
repeats they had. However, we failed to get a clean sequence for each line, which may be caused by 
PCR-generated stutter [4,7]. Based on the frequency of appearance of the SSR sequences for each line 
(Table 2), the approximate lengths of the Xwmc312 alleles of AS313, AS60, LDN, and SynDH line 61 
were estimated as 197 bp (12 repeats, Genbank number JQ404445), 197 bp (12 repeats, JQ404444), 
229 bp (28 repeats, JQ404446), and 215 bp (21 repeats, JQ404447), respectively (Figure 5). It seems 
that the novel allele in SynDHl line 61 was originated by deleting seven GA repeats from the 
LDN allele. 

Table 2. The distribution of SSR sequence length for marker Xwmc312. 



No. of Number of clones with different sequence length 



Materials 


sequenced 


193 


195 


197 


199 


213 


215 


217 


219 


223 


227 


229 


231 


233 




clones 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


(bp) 


AS313 


8 




1 


7 






















Langdon 


9 


















1 


2 


4 


1 


1 


AS60 


9 


1 


1 


6 


1 




















SynDH 1-61 
(SI) 


13 












9 


3 


1 












SynDH 1-61 
(S2) 


11 






1 




3 


4 


3 
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Figure 5. Sequence alignment of the DNA sequence of the Xwmc312 alleles cloned from 
SynDHl-61 (Genbank number JQ404447), LDN (JQ404446), AS313 (JQ404445) and 
AS60 (JQ404444). The red rectangle with an arrow indicated forward primer region and 
the yellow rectangle with an arrow indicated reverse primer region. The diagonal 
represented the ellipsis sequences. 



0 10 20 150 160 170 180 190 200 210 220 
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The SSR mutation rate Si generation was estimated by dividing the number of the observed mutant 
allele (o) by the total number of possible mutant alleles (n). The n number for Si generation of each 
SynDH population was estimated based on the total observed SSR alleles at all of the loci (Table 1). 
The estimated average mutation rates were, therefore, 1/93790 = 1.06 x 10~ 5 for SynDHl Si and 
1/148260 = 6.74 x 10~ 6 for the three Si populations. The estimated SSR mutation rates were lower 
than the ca. 10~ 3 ~10~ 4 per allele per generation that were previously estimated in durum wheat [7] and 
common wheat [4]. Except for marker Xcfdl 61, repeat motif of markers used in this study has a length 
of at least 2 bp (Table SI). We used 6% polyacrylamide gels that can detect 1 bp difference between 
110 and 1 1 1 bp products (Figure 4). Given the PCR-stutter and the pooled screening with a bulk DNA 
of five plants for each line, however, we cannot exclude that the low mutation rate resulted from 
underestimated small variations of 1 or 2 bp. Anyway, this study did not provide evidence that 
allopolyploidization during resynthesizing hexaploid wheat increased SSR mutation rate, which 
was agreed with that SSR sequences are highly conserved during allopolyploidization of hexaploid 
wheat [25] and of Brassica [15]. 

As mentioned earlier in this communication, allopolyploidization-accompanied DNA sequence 
rearrangements have been reported in some allopolyploids. However, no evidence for sequence 
variation was found in re-synthesized cotton [26] and some allohexaploid wheats [25,27,28]. These 
different conclusions may be a result of different genome combinations and/or different DNA types 
revealed by different technical methods. The effects of genome combination on inducing variations 
have shown by that hybrids of wheat with rye have higher rate of DNA sequence changes than hybrids 
of wheat with other related species [19,20]. This might be due to the fact that rye is more genetically 
distant to wheat than other species used. Effects by DNA types have been also demonstrated by 
Lukens et al. [15] with the allopolyploid Brassica. Their results indicate no variation for SSR, low 
frequent variations for restriction fragment length polymorphism (RFLP), but high frequent variations 
for DNA methylation. The methods used to assess DNA variation based on methyl-sensitive or 
nonsensitive restriction enzymes also revealed a lot of variations in allopolyploid wheat [11-13]. 
However, SSR sequences are highly conserved in allohexaploid wheat [25]. 

The different conclusions from different DNA types may be finally attributed to different mutation 
mechanisms. Ellegren [1] proposed two mechanisms for SSR mutation: DNA polymerase slippage and 
unequal recombination. The former usually leads to a single novel allele and the latter should result in 
co-existence of alleles of different lengths. It seemed that the mutant Xwmc312 allele observed in this 
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study might result from DNA polymerase slippage, although the unequal recombination model could 
not be completely excluded [29]. The highly conserved SSRs during allopolyploidization of wheat 
may indicate that enzymes for replication slippage are conserved among A, B, and D genomes of its 
parent T. turgidum and Ae. tauschii. Comparing these enzymes among different species and their 
relationships with SSR mutation rates are necessary to further test this hypothesis. 

3. Experimental Section 

3.1. Plant Materials 

Plant materials used in this study included Triticum turgidum ssp. durum line Langdon (abbreviated 
as LDN), ssp. dicoccon line PI377655, ssp. turgidum lines AS313 and AS2255, Aegilops tauschii 
accessions AS60, AS66 and AS87, and their synthesizing double haploid populations SynDHl, 
SynDH2, and SynDH3 (Table 1). These SynDHs were obtained by spontaneous chromosome doubling 
with the help of unreduced gametes [24,30]. Tetraploid Fi hybrids of LDN x AS313 and 
LDN x AS2255 were pollinated by Ae. tauschii AS60 to form triploid Fi hybrids with genomes ABD, 
respectively. The Si seeds for SynDHl and SynDH2 populations were obtained by selfmg the 
LDN/AS313//AS60 and LDN/AS2255//AS60 triploid Fi hybrid plants, respectively [24]. Similarly, Si 
seeds for SynDH3 population were obtained from the triploid Fis between AS66 x AS87 and 
PI377655 (Figure 1). All the triploid Fi hybrids for the three populations spontaneously produced Si 
doubled haploid seeds since all the three T. turgidum parents have genes for the formation of 
unreduced gametes in haploid status [24,30]. Si seeds from a single triploid Fi plant formed a SynDH 
line. Only A- and B-genome chromosomes between T. turgidum lines were involved in genetic 
recombination for SynDHl and SynDH2 populations [24], while only D-genome chromosomes 
between Ae. tauschii lines were involved in recombination for SynDH3 (Figure 1). No embryo rescue 
technique or hormone treatment was applied for the production of triploid Fi hybrids. Five Si plants 
were grown for each SynDH line in order to obtain a bulked leaf sample for genomic DNA isolation. 

3.2. Simple Sequence Repeats (SSRs) Analysis 

Leaf samples were collected at heading stage from parents and the SynDH lines, ground in liquid 
nitrogen and used for DNA isolation with a modified 2 x CTAB method [31]. For each SynDH line, 
total genomic DNA was isolated from a leaf sample bulked from five Si plants [24], which represented 
independent chromosome doubling events that are expected to be genetically identical. 

Eight-one polymorphic SSR markers between the two T. turgidum parents of LDN and AS313 were 
used in investigation of SynDHl population (Table SI). Of them, 12 on chromosome 3B or 6B that 
were polymorphic between the parents LDN and AS2255 were also used in the study of SynDH2 
population (Table SI). These SSR markers have been mapped on A- or B-genome chromosome, 
respectively. Seventy-nine markers from D genome chromosomes were used in analyzing SynDH3 
population (Table SI). They were polymorphic between Ae. tauschii parents AS66 and AS87 and, 
respectively, represent each D-genome chromosome [32-37]. PCR reactions were performed as 
previously described [24]. All of the SynDH samples were run together with their parent DNA as the 
controls. The amplified fragments were separated by electrophoresis in 6% polyacrylamide denaturing 
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gels (8 M urea, 35-cm-wide x 29-cm-long) and visualized with silver-staining method [31]. Run the 
gels in lx TBE buffer at 75 W and 1600 V constant power until the indicator bromophenol blue 
reached the bottom. 

For the survey of SSR mutations, amplification patterns observed in the SynDH lines were 
compared with those of their respective parents. The presence of a SSR fragment in a parent that was 
absent from the corresponding SynDH line was recognized as a loss. The presence of a SSR fragment 
in a SynDH line that was not observed in the parents was recognized as a gain. Accordingly, a marker 
was considered as non-changed if its patterns in a SynDH line were a combination of the parents'. By 
contrast, a marker was considered as changed if an unexpected amplification fragment appeared in a 
SynDH line. All PCR reactions that generated unexpected changes of either a loss- or a gain-type or an 
unclear fragment were repeated for confirmation. 

3.3. Cloning and Sequencing 

Mutant alleles and their parent alleles were re-amplified, cloned, and sequenced to reconfirm the 
mutation and to determine its nature. The re-amplified PCR products were separated by 6% 
polyacrylamide denaturing gels. The target fragments were cut and boiled in 30 |iL ddH 2 0 for 10 min 
and then centrifuged at 4000x g for 10 min. The upper liquid was used to conduct the second PCR 
reaction and the PCR products were then separated on 2.5% agarose gels followed by purification 
using Gel DNA Recovery Kit (PUEX, USA.). The purified PCR products were cloned into pMD19-T 
vector using cloning kit from TaKaRa Biotechnology (Dalian, China) Co., Ltd. [38]. Positive clones 
were identified and then sequenced by BGI (Beijing, China). Sequence alignments were conducted 
using the DNAMAN 6.0 Demo software (Lynnon Biosoft). 

3.4. Estimation of SSR Mutation Rate 

In this study, a SSR amplification pattern from a parent was estimated as an SSR allele. This 
estimation was supported by our mapping results that most of the 160 markers used in this study were 
mapped as a single locus on a chromosome based on their genotyping using the three SynDH 
populations (Table SI). Based on the observed amplification pattern in SynDHs, the scored SSR 
markers were further classified as two types: (A) amplifying SSR products at two SSR alleles in a 
SynDH line, one from T. turgidum and the other from Ae. tauschii', (B) only amplifying SSR products 
at one of the two SSR alleles from T. turgidum andAe. tauschii. 

The mutation rate was calculated by dividing the allele number of observed mutants by the allele 
number (n) of total possible mutation events, which were calculated by summarizing all the alleles 
from all the plants in a SynDH population. Depending upon the nature of a SSR marker, an Si plant 
with AABBDD genome composition could reveal two homologous alleles from T. turgidum and two 
from Ae. tauschii or only reveal two homologous alleles from one of the parents. Meanwhile, a bulked 
DNA of five Si plants for each SynDH line was used. Therefore, for a Si population, the n was 
estimated by the following equation: 

n = (4A + 2B) xNx 5 

where: 
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A: number of SSR markers that revealed four alleles 
B: number of SSR markers that revealed two alleles 
TV: number of Si lines. 

4. Conclusions 

In this study, mutation events were studied at 160 SSR loci in Si generation of the three SynDH 
populations. Of the 148260 SSR alleles investigated in Si generation, only one mutation (changed 
number of repeats) was confirmed with a mutation rate of 6.74 x 10~ 6 . This SSR mutation was 
homozygous in the analyzed Si plants of line SynDH 1-61. Therefore, this mutation most likely 
occurred in the respective Fi hybrid and then homologized in the Si via the union of unreduced 
gametes. Compared with previously reported data as regards durum wheat or common wheat itself, our 
results suggested that allohexaploidization of wheat did not increase the SSR mutation rate. 
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